11. Exploring with Visuals

Exploring with Visuals

Use the notebook below to perform exploratory data analysis on your newly combined dataframe. Create some visuals to answer these quiz questions below the notebook.

  • Based on histograms of columns in this dataset, which of the following feature variables appear skewed to the right? Fixed Acidity, Total Sulfur Dioxide, pH, Alcohol
  • Based on scatterplots of quality against different feature variables, which of the following is most likely to have a positive impact on quality? Volatile Acidity, Residual Sugar, pH, Alcohol

Workspace

This section contains either a workspace (it can be a Jupyter Notebook workspace or an online code editor work space, etc.) and it cannot be automatically downloaded to be generated here. Please access the classroom with your account and manually download the workspace to your local machine. Note that for some courses, Udacity upload the workspace files onto https://github.com/udacity , so you may be able to download them there.

Workspace Information:

  • Default file path:
  • Workspace type: jupyter
  • Opened files (when workspace is loaded): n/a

Based on histograms of columns in this dataset, which feature variables appear skewed to the right?

SOLUTION:
  • Fixed Acidity
  • Alcohol

Based on scatterplots of quality against different feature variables, which of the following is most likely to have a positive impact on quality?

SOLUTION: Alcohol

Now that you've cleaned and learned more about your data, you can start drawing conclusions based on the questions from the quiz at the beginning of this lesson. In order to perform the analysis for these conclusions, let’s learn about two helpful functions in Pandas that you'll be using - groupby and query.